Designing Committees of Models through Deliberate Weighting of Data Points
نویسندگان
چکیده
In the adaptive derivation of mathematical models from data, each data point should contribute with a weight reflecting the amount of confidence one has in it. When no additional information for data confidence is available, all the data points should be considered equal, and are also generally given the same weight. In the formation of committees of models, however, this is often not the case and the data points may exercise unequal, even random, influence over the committee formation. In this paper, a principled approach to committee design is presented. The construction of a committee design matrix is detailed through which each data point will contribute to the committee formation with a fixed weight, while contributing with different individual weights to the derivation of the different constituent models, thus encouraging model diversity whilst not biasing the committee inadvertently towards any particular data points. Not distinctly an algorithm, it is instead a framework within which several different committee approaches may be realised. Whereas the focus in the paper lies entirely on regression, the principles discussed extend readily to classification.
منابع مشابه
Designing reading tasks to maximise vocabulary learning
Most vocabulary learning should occur incidentally through listening and reading. This is one of the reasons why a substantial extensive reading program is an important part of an English course. Extensive reading requires the learners to do large quantities of reading using material that is at the right level for them. Vocabulary learning occurs through th...
متن کاملDesigning a Model for Implementing Energy Policies in the Oil and Gas Sector
By studying the complex process of energy policy formulation for the oil and gas sector in Iran, we notice that these policies are not fully implemented due to inefficiency of the executive model. In this paper, we use the qualitative research of data theory, to analyze the situation through a combination of semi-structured interviews and study of available data. We use a snowball (chain refer...
متن کاملRanking DMUs by ideal points in the presence of fuzzy and ordinal data
Envelopment Analysis (DEA) is a very eective method to evaluate the relative eciency of decision-making units (DMUs). DEA models divided all DMUs in two categories: ecient and inecientDMUs, and don't able to discriminant between ecient DMUs. On the other hand, the observedvalues of the input and output data in real-life problems are sometimes imprecise or vague, suchas interval data, ordinal da...
متن کاملComparing Three Regression Models for Reconstructing Groundwater Level Data (A Case Study)
The base for hydrology studies is accurate data. However, the gaps and shortage of sufficient data exist n the most hydrology data such as underground water data as the most important and cheapest water source, lack of data take places due to various reasons such as Inability to measure and faille to register statistics. Missing data or incorrect statistics, Therefore, estimating the missin...
متن کاملThe Informativeness of Reported Earnings and Characteristics of the Audit Committee
An information usefulness approach to decision making points out that only the information is regarded as useful that will bring valuable messages to investors and lead to stock price adjustments. This study examines the effectiveness of audit committees in improving earnings quality and informativeness, particularly among family-owned firms. Earnings informativeness was measured through the re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Machine Learning Research
دوره 4 شماره
صفحات -
تاریخ انتشار 2003